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Abstract. In this paper we introduce and study a new property of in- 
finite words which is invariant under the action of a morphism: We say 
an infinite word x £ A***, defined over a finite alphabet A, is self-shuffling 
if X admits factorizations: x = Yll^i UiVi = Ili^i ~ Ili^i ^» with 
Ui,Vi £ A"*". In other words, there exists a shuffle of x with itself which 
reproduces x. The morphic image of any self-shuffling word is again self- 
shuffling. We prove that many important and well studied words are 
self-shuffling: This includes the Thue-Morse word and all Sturmian words 
(except those of the form aC where a £ {0, f } and C is a characteristic 
Sturmian word). We further establish a number of necessary conditions 
for a word to be self-shuffling, and show that certain other important 
words (including the paper-folding word and infinite Lyndon words) are 
not self-shuffling. In addition to its morphic invariance, which can be used 
to show that one word is not the morphic image of another, this new no- 
tion has other unexpected applications: For instance, as a consequence of 
our characterization of self-shuffling Sturmian words, we recover a num- 
ber theoretic result, originally due to Yasutomi, which characterizes pure 
morphic Sturmian words in the orbit of the characteristic. 



1 Introduction 



Let A be a finite non-empty set. We denote by A* the set of all finite words 
u = X1X2 ■ ■ ■ Xn with Xi G A. The quantity n is called the length of u and 
is denoted The empty word, denoted e, is the unique element in A* with 
|e| = 0. We set A+ = A - {e}. We denote by A^* the set of aU one-sided infinite 
words X = X0X1X2 ■ ■ ■ with Xi G A. 

Given /c-finite or infinite words x'^^^ , x'-^^ , • . • , x^''^ £ A* (J A^ we denote by 

^(a;«,a;(2),...,x('=)) C A*UAN 



the collection of all words z for which there exists a factorization 

oo 
1=0 

with each C/f-''^ e A* and with x^-') = O^^o ^t'^ 1 < J < fc- Intuitively, z may 
be obtained as a shuffle of the words a;^"'^^ , x*^^^ , • • ■ , a;'^'"'^ . In case x'^^^ , x*^^^ , • • • , a;^*^-* G 
A* , each of the above products can be taken to be finite. 

Finite word shuffles were extensively studied in [5]. Given a; e A*, it is gen- 
erally a difficult problem to determine whether there exists y e A* such that 
X S S''{y,y) (see Open Problem 4 in [5]). However, in the context of infinite 
words, this question is essentially trivial: In fact, it is readily verified that if 
X € A^ is such that each a G A occurring in x occurs an infinite number of times 
in X, then there exist infinitely many y e A^ with x G -^{y, y)- Instead, in the 
framework of infinite words, a far more delicate question is the following: 

Question 1. Given x G A^, docs there exist an integer k > 2 such that x G 
fe 

If such a k exists, we say x is k-self-shuffting. 

Given x = xqXiX2 ■ ■ ■ G A^ and an infinite subset N = {Nq < Ni < N2 < 
. . .} C N, we put x[N] — xmqXNiXn2 ■ ■ ■ G A'^. Alternatively, 

Definition 1. For x G A^ and fc = 2, 3, . . ., we say x is k-self-shuffting if there 
exists a fc-element partition N = lJi=i with a;[A^*] = x for each i = 1, . . . , k. 

In case k = 2, we say simply x is self- shuffling. We note that if x is /c-self- 
shuffling, then x is €-self-shufHing for each £ > k but not conversely (see fQ]), 
whence each self-shuffling word is fc-self-shuffling for all fc > 2. In this paper 
we are primarily interested in self-shuffling words, however, many of the results 
presented here extend to general fc. Thus x G A^ is self-shuffling if and only if x 
admits factorizations 

00 00 00 

a: - n = n = n 

2—1 2—1 2 — 1 

with U^,V^eA+. 

The property of being self-shuffling is an intrinsic property of the word (and 
not of the associated language) and seems largely independent of its complexity 
(examples exist from the lowest to the highest possible complexity) . The simplest 
class of self-shuffling words consists of all (purely) periodic words a; = w"^. It is 
clear that if x is self-shuffling, then every letter a G A occurring in x must occur 
an infinite number of times. Thus for instance, the ultimately periodic word 01" 
is not self-shuffling. As we shall see, many well-known words which are of interest 
in both combinatorics on words and symbolic dynamics, are self-shuffling. This 
includes for instance the famous Thue-Morse word 



T = 0110100110010110100101100110100110010110 



whose origins go back to the beginning of the last century with the works of 
the Norwegian mathematician Axel Thue |12I13| . The nth entry tn of T is de- 
fined as the sum modulo 2 of the digits in the binary expansion of n. While 
the Thue-Morse word appears naturally in many different areas of mathematics 
(from discrete mathematics to number theory to differential geometry-see [1] or 
[5]), proving that Thue-Morse is self-shuffling is somewhat more involved than 
expected. 

Sturmian words constitute another important class of aperiodic self-shufhing 
words. Sturmian words are infinite words over a binary alphabet having exactly 
n+1 factors of length n for each n > 8 . Their origin can be traced back to the 
astronomer J. Bernoulli III in 1772. They arise naturally in many different areas 
of mathematics including combinatorics, algebra, number theory, ergodic theory, 
dynamical systems and differential equations. Sturmian words are also of great 
importance in theoretical physics and in theoretical computer science and are 
used in computer graphics as digital approximation of straight lines. We show 
that all Sturmian words are self-shuffling except those of the form aC where 
a G {0, 1} and C is a characteristic Sturmian word. Thus for every irrational 
number a, all (uncountably many) Sturmian words of slope a are self-shuffling 
except for two. Our proof relies on a geometric characterization of Sturmian 
words via irrational rotations on the circle. 

So while there are many natural examples of aperiodic self-shuffling words, 
the property of being self-shuffling is nevertheless quite restrictive. We obtain 
a number of necessary (and in some cases sufficient) conditions for a word to 
be self-shuffling. For instance, if a word x is self-shuffling, then x begins in only 
finitely many Abelian border-free words. As an application of this we show that 
the well-known paper folding word is not self-shuffling. Infinite Lyndon words 
(i.e., infinite words which are lexicographically smaller than each of its suffixes) 
are also shown not to be self-shuffling. 

One important feature of self-shuffling words stems from its invariance under 
the action of a morphism: The morphic image of a self-shuffiing word is again self- 
shuffling. In some instances this provides a useful tool for showing that one word 
is not the morphic image of another. So for instance, the paper folding word is not 
the morphic image of any self-shuffling word. However this application requires 
knowing a priori whether a given word is or is not self-shuffling. In general, 
to show that a word is self-shuffling, one must actually exhibit a shuffle. Self- 
shuffling words have other unexpected applications particularly in the study of 
fixed points of substitutions. For instance, as an almost immediate consequence 
of our characterization of self-shuffling Sturmian words, we recover a result, first 
proved by Yasutomi via number theoretic methods, which characterizes pure 
morphic Sturmian words in the orbit of the characteristic. 

2 Examples &z Non-examples 

In this section we list some examples and non-examples of self-shuffling words. 
As usual in combinatorics on words, we follow notation from [8]. 



Fibonacci word: The Fibonacci infinite word 

X = 0100101001001010010100 . . . 

is defined as tlie fixed point of the morphism ip given by 01, 1 n- 0. It 
is readily verified that 93^(0) = (p{a)a for each a e {0,1}. Whence, writing 
X — X0X1X2 ■ ■ ■ with each Xi G {0, 1} we obtain 

X = X0X1X2 ... = (p{xo)(p{xi)(p{x2) . . . = (p'^{xn)ip'^{xi)ip'^{x2) ...= 

= ip{xQ)xo(p{xi)xiip{x2)x2 ■ ■ . 

which shows that x is self-shuffling. In contrast, the word y = Oa; is not self- 
shuffling. The word y starts with infinitely many prefixes of the form 0i?l with 
B a palindrome. It follows that 0i?l is Abelian border-free (i.e., no proper suffix 
of 0-Bl is Abelian equivalent to a proper prefix of U). By Proposition [3] the word 
y is not self-shuffling. 

Paper-folding word: The paper-folding word 

X ^ 00100110001101100010 . . . 

is a Toeplitz word generated by the pattern u — 0?1? (see[4]). It is readily ver- 
ified that X begins in arbitrarily long Abelian border-free words and hence by 
Proposition [3] is not self-shuffling. More precisely, the prefixes Uj of x of length 
rij = 2^ — 1 are Abelian border- free. Indeed, it is verified that for each k < rij, we 
have |prefj,(uj)|o > k/2 while |sufffc(uj)|o < k/2. Here prefj,(u) (resp., sufffc(it)) 
denotes the prefix (resp., suffix) of length fc of a word u, an \u\a is the number 
of occurrences of a letter a in u. 

A 3-self-shuffling word which is not self-shuffling: Let y denote the fixed 
point of the morphism cr : 1— > 0001 and 1 0101, and put 

X = Q-^y = 01000100010101000100010001010100010001000101010001010100. . . 

Then for each prefix uj of x of length 4^ —2, the longest Abelian border of Uj of 
length less than or equal to (4^ — 2)/2 has length 2. Hence x is not self-shuffling 
(see Proposition [3]). The 3-shuffle is given by the following: 



Uo = 


0100, (7i 


= 01, 


■ , U4i+2 = 


e, 


U4i+3 = 


a +'(0100), 








U4i+4 = 


a(0). 


U4i+5 = 


(a(0))-V'+i(01), 


Vo = 


0100, Vi ■- 


= 01, 


. , V4i+2 = 


(a(0))-V- 


•-^(0), Vm+3 = 


a(0). 








V4i+4 = 


(a(0))-V- 


^'(Ol)a(O), V4.+5 = 


e, 


Wo = 


= 01, Wi 




. , W4i+2 = 


= e, 


W4i+3 = 


= (a(0))-V^+'(01). 








W4i+4 = 


= e, 


W4i+5 = 


= cr*+'(0)a(0). 



It is then verified that 

X = n ^^'^^^^ = n = n = n 

i=0 i=0 i=0 i=0 



from which it follows that x is 3-self-shufiling. 



A recurrent binary self-shuffling word with full complexity: For each 
positive integer n, let Zn denote the concatenation of all words of length n in 
increasing lexicographic order. For example, Z2 = 00011011. For i > put 




Zn, if i = n2" ^ for some n, 
0*1% otherwise, 



and define 

x = Y[Xi = OlOlOOllO^Oll^O'^OlO^l^Oll'^ . . . , 

i=0 

where Xq = Xi = 01, X2 = 0011, and for i > 3, X, = 0*yi_2l% where yi-2 = 
yi-3Vi-2yi-3, and yo = £■ We note that x is recurrent and has full complexity 
(since it contains Zn as a factor for every n). 

To show that the word x is sclf-shufHing, we first show that G ,5^{Xi,Xi). 
Take N = {0, . . . , i- l,i + 1, . . . , 2* 2* + 2'+^ -i- 1}, where w|i de- 

notes the set of positions j of a word u in which the j'-th letter Uj of u is equal to 
1. Then it is straightforward to see that Xj = Xi+i[N] = Xi+i[{l, . . . , 2'+'^}\N]. 
The self-shuffle of x is built in a natural way concatenating shuffles of Xi starting 
with Uo = Vo = 01, so that Xq . . . Xi+i e ^{Xq . . . X^, Xq . . . X,). 

3 General Properties 

In this section we develop several fundamental properties of self- shuffling words. 
The next two propositions show the invariance of self-shuffling words with respect 
to the action of a morphism: 

Proposition 1. Let A and B be finite non-empty sets and r : A — >• B* a mor- 
phism. If X G is self-shuffling, then so is t{x) e B^. 

Proof. If X G y{x,x), then we can write x = Hi^i ^i^i ^ Ili^i ^* = Ili^i 
Whence t{x) = UZi riU.V,) = UZi riUMVi) = HZi ^(C^O = n,=i riV,) as 
required. 

Proposition 2. Let r : A — >• A* be a morphism, and x G A^ be a fixed point of 

T. 

1. Let u he a prefix of x and k be a positive integer such that r'^ia) begins in u 
for each a € A. Then if x is self-shuffling, then so is u^^x. 

2. Let u G A*, and let k be a positive integer such that r*^ (a) ends in u for each 
a G A. Then if x is self-shuffling, then so is ux. 

Proof. We prove only item(l) since the proof of (2) is essentially identical. Sup- 
pose X = rii^i UiVi = rii^i — rii^i Then by assumption, for each i>l 



we can write r^{U^) = uU^ and r^{V,) - uV^ for some V/ € A*. Put X, = U[u 
and Yi = V/u. Then since 



oo 



oc 



oo 



oc 



X = 



1=1 i=l 



we deduce that 



oo oo 



oo 



u 



1 



X — 



1=1 1=1 



i=l 



Corollary 1. Let t : A ^ A* be a primitive morphism, and a £ A. Suppose 
r(6) begins in a (respectively ends in a) for each letter b G A. Suppose further 
that the fixed point (a) is self- shuffling. Then every forward shift (respectively 
backward shift) of T°°{a) is self- shuffling. 

Remark 1. Since the Fibonacci word is self-shufHing and is fixed by the primitive 
morphism i-> 01, 1 i— > 0, it follows from Corollary [1] that every tail of the 
Fibonacci word is sclf-shuffling. 

There are a number of necessary conditions that a sclf-shuffling word must sat- 
isfy, which may be used to deduce that a given word is not self shuffling. For 
instance: 

Proposition 3. If x € is self-shuffling, then for each positive integer N 
there exists a positive integer M such that every prefix u of x with \u\ > M has 
an Abelian border v with \u\/2 > \v\ > N. In particular, x must begin in only a 
finite number of Abelian border-free words. 

Proof. Suppose to the contrary that there exist factorizations x = Ili^o ^i^i ~ 
YliLo ^» ~ rii^o ^» with Ui,Vi e A+ , and there exists N such that for every M 
there exists a prefix u oi x with |u| > M which has no Abelian borders of length 
between N and \u\/2. Take M = \ Ui=oUtVi\ and a prefix u satisfying these 
conditions. Then there exist non-empty proper prefixes U' and V' of u such that 
u e S^{U', V) with \U'l \V'\ > N. Writing u = U'U" it follows that U" and V 
are Abelian equivalent. This contradicts that u has no Abelian borders of length 
between N and |u|/2. 

An extension of this argument gives both a necessary and sufficient condition 
for self-shufHing in terms of Abelian borders (which is however difficult to check 
in practice). For w G A* let ^{u) denote the Parikh vector of u, i. e., W^u) = 
(|u|a)agA, where \u\a denotes the number of occurrences of a in u. 

Definition 2. Given x G A^, we define a directed graph Gx — {Vx, Ex) with 
vertex set 



Vx = {(n,m) G I '^{wci^x) + <f'(pref„a;) = if (pref„+„a;)} 



and the edge set 

= { ((n,m), (n',m')) £ K x K | 

n' = n + 1 and m! — m or m' = rn + 1 and n' ^ n}. 

We say that Gx connects to oo if there exists an infinite path rij^iC^-ii '^j) 
in Gx such that (no, mo) = (0, 0) and nj, ruj ^> oo as j — oo. 

Theorem 1. A wordx 6 is self- shuffling if and only if the graph Gx connects 
to oo. 

The theorem gives a constructive necessary and sufficient condition for self- 
shuffling since a path to infinity defines a self-shuffle. 

As we shall now see, lexicographically extremal words are never self-shuffling. 
Let (A, <) be a finite linearly ordered set. Then < induces a partial ordering <iex 
on A+ and A^ defined as follows: If u, w G A+ (or A^) we write u <iox v if either 
It = u or if M is lexicographically smaller than v. In the latter case we write 
u <iox V- If w is a proper prefix of v then u and v are not comparable. 

Let X e A'^. a factor u of a; is called minimal (in x) if u <iex v for all factors 
V of X with \v\ = \u\. An infinite word y in the shift orbit closure Sx of x is called 
Lyndon (in Sx) if every prefix of y is minimal in x. The proof of the following 
result is omitted for space considerations: 

Theorem 2. Let {A, <) be a linearly ordered finite set and let x G A^\ Let 
y,z ^ Sx with y Lyndon and aperiodic. Then for each w £ S^{y,z), we have 
w <icx z. In particular, taking z — y we deduce that y is not self- shuffling. 

Let A be a finite non-empty set. We say x £ A'^ is extremal if there exists a 
linear ordering < on A with respect to which x is Lyndon. As an immediate 
consequence of Theorem [2] we obtain: 

Corollary 2. Let A be a finite non-empty set and x £ A'^ be an aperiodic ex- 
tremal word. Then x is not self-.shuffl,ing. 

Remark 2. Let x = IIOIOOIIOOIOIIOIOOIOIIO . . . denote the first shift of the 
Thue-Morse infinite word. It is well known that x is extremal and hence is not 
self-shuffling; yet it can be verified that x begins in only a finite number of 
Abelian border-free words. 

4 The Thue-Morse Word Is Self-Shuffling 

Theorem 3. The Thue-Morse word T ^ 011010011001 . . . fixed by the mor- 
phism T mapping i— > Of and I t-> 10 is self- shuffling. 

Proof. For u £ {0, 1}* we denote by u the word obtained from u by exchanging 
Os and Is. Let a : {1, 2, 3, 4} {I, 2, 3, 4}* be the morphism defined by 



cr(I) = 12, cr(2)=31, ct(3)=34, a(4) = 13. 



Set u = 01101 and v = 001; note that uv is a prefix of T. Also define morphisms 
5, /i: {1,2, 3, 4} ^{0,1}* by 

fif(l) = vu, g{2) = vu, g{3) = vu, ^(4) = vu 

and 

h{l) = uv, h{2) = uv, h{3) = uv, /i(4) = uv 
We will make use of the following lemmas: 

Lemma 1. g{a{a)) G y{g{a),h{a)) for eacha G {1, 2, 2), 4^}. In particular ug{(j{l)) G 
S'iug{l),h{l)). 

Proof. For a = 1 we note that 

g{<7{l)) = g{12) = vuvu = 0011001011010010. 
Factoring 0011001011010010 = • Oil • • 010 • 11 • 01 • 0010 we obtain 

g{(T{l)) G =5^(00110010, 01101001) = y{vu, uv) = ^(ff(l), h{l)). 

Similarly, for a = 2 we have 

gia{2)) = g(31) = vuvu = 1100110100110010. 
Factoring 1100110100110010 = 1 • 100 • 1 ■ 1 ■ 010 ■ 0110 ■ 010 wc obtain 

g{(j{2)) G ^(11010010, 10010110) = S^{vu, uv) = y{g{2), h{2)). 

Exchanging Os and Is in the previous two shufHes yields 

g{a{3)) = 5(34) = vuvu G y{vu,uv) = y{g{3),h{3)) 

and 

g{a{A)) = 5(13) = vuvu G y{vu,uv) = y{g{4),h{A)). 
It is readily verified that 

Lemma 2. h{a{aj) = T{h{a)) for each a G {1,2,3,4}. 

Let w = WQW1W2W3 . . . with Wi G {1, 2, 3, 4} denote the fixed point of cr beginning 
in 1. As a consequence of the previous lemma we deduce that 

Lemma 3. T = h{w). 

Proof. In fact T{h{w)) = h{a{w)) = h{w) from which it follows that h{w) is one 
of the two fixed points of r. Since h{w) begins in h{l) which in turn begins in 
0, it follows that T = h{w). 

Lemma 4. T = ug{w). 



Proof. It is readily verified that: 



ug{l) - h{l)u 

ug{2) = h{i)u 
ug{i) — h{3)u 
ug{4) = h{4)u. 

Moreover, each occurrence of g{l) and g(4) in ug{w) is preceded by u while 
each occurrence of g{2) and 5(8) in ug{w) is preceded by u. It follows that 
ug{w) — h{w) which by the preceding lemma equals T. 

Set 

^0 = ug{(T{wo)) and Ai = g{(T{wi)), for i > 1 
Bq = ug{wo)) and Bi — g{wi)), for i > 1 

and 

d — h{wi) for i > 0. 

It follows from Lemma 12] and Lemma H] that 

00 00 00 

T = i[A, = i[B, = i[a 

i=0 i=0 1=0 

and it follows from Lemma [T] that Ai e ,5^{Bi,Ci) for each i > 0. Hence T G 
y{T,T) as required. 

5 Self-Shuffling Sturmian Words 

In this section we characterize self-shuffling Sturmian words. Sturmian words 
admit various types of characterizations of geometric and combinatorial nature, 
e. g., they can be defined via balance, complexity, morphisms, etc. (see 8J. In 
Hedlund and Morse showed that each Sturmian word may be realized geo- 
metrically by an irrational rotation on the circle. More precisely, every Sturmian 
word X is obtained by coding the symbolic orbit of a point p{x) on the circle (of 
circumference one) under a rotation by an irrational angle a where the circle is 
partitioned into two complementary intervals, one of length a (labeled 1) and 
the other of length 1 — a (labeled 0) (see Figured]). And conversely each such 
coding gives rise to a Sturmian word. The irrational a is called the slope and the 
point p{x) is called the intercept of the Sturmian word x. A Sturmian word x of 
slope a with p{x) = a is called a characteristic Sturmian word. It is well known 
that every prefix m of a characteristic Sturmian word is left special, i.e., both Ow 
and lu are factors of a; [8 . Thus if a; is a characteristic Sturmian word of slope 
a, then both Ox and Ix are Sturmian words of slope a and p{Qx) = p{\x) = 0. 
The fact that p is not one-to-one stems from the ambiguity of the coding of the 
boundary points and 1 — a. 




Fig. 1. Geometric picture of a Sturmian word of slope a. 



Theorem 4. Let S, M and L be Sturmian words of the same slope a, < a < 1, 
satisfying S <iex M <iex L. Then M £ J>^(S,L) if and only if the following 
conditions hold: If p{M) ~ p{S) (respectively, p{M) — p{L)), then p{L) ^ 
(respectively p{S) ^ 0). 

In particular (taking S = M = L), we obtain 

Corollary 3. A Sturmian word x € {0, 1}^ is self-shuffling if and only if p{x) ^ 
0, or equivalently, x is not of the form aC where a S {0, 1} and C is a charac- 
teristic Sturmian word. 

Our proof explicitly describes an algorithm for shuffling S and L so as to pro- 
duce M. It is formulated in terms of the circle rotation description of Sturmian 
words. Geometrically speaking, points p{S) and p{L) will take turns following 
the trajectory of p{M) so that the respective codings agree; as one follows the 
other waits its turn (remains neutral) . The algorithm specifies this following rule 
depending on the relative positions of the trajectories of all three points and is 
broken down into several cases. The proof can be summarized by the directed 
graph in Figure [5] in which each state n corresponds to "case n" in the proof. 




4 j f 5 

Fig. 2. Graphical depiction of the proof of Theorem [l] 



We denote by s, m, and £ the current tail of the words S, M, and L. They 
are initialized as 

s := S, I := L, and m := M. 



While m is always a tail of Af, the letters s and i may be tails of S or L, 
depending on which is the current lexicographically largeslQ. Each directed edge 
corresponds to a precise set of instructions which specify which of s or f is 
neutral, which of s or £ follows m and for how long, and in the end a possible 
relabeling of the variables s and i. In each case the outcome leads to a new case 
in which there is a switch in the follower. In other words, if there is an edge from 
case i to case j in the graph, then either the instructions for case i and case j 
specify different followers (as is the case for cases 1.1 and 2.1) in which case the 
passage from i to j leaves the labeling of s and £ unchanged, or the instructions 
for case i and case j specify the same follower (as is the case for cases 1.2 and 
1.1) in which case the passage from i to j exchanges the labeling of s and £. 
The proof of Theorem [3] amounts to showing that for each state n in the graph, 
the specified instructions will take n to an adjacent state in the graph. 

As an almost immediate application of Corollary [3] we recover the following 
result originally proved by Yasutomi in [14 and later reproved by Berthe, Ei, 
Ito and Rao in 3 and independently by Fagnot in [6 . We say an infinite word is 
pure morphic if it is a fixed point of some morphism different from the identity. 

Theorem 5 (Yasutomi [14j). Let x S {0,1}^ he a characteristic Sturmian 
word. If y is a pure morphic word in the orbit of x, then y 6 {x, Oa:, Ix, Q\x, lOx}. 

Proof. We begin with some preliminary observations. Let n{x) denote the set of 
all left and right infinite words y such that J^{x) — J^{y) where J-{x) and J'{y) 
denote the set of all factors of x and y respectively. If y G n{x) is a right infinite 
word, and Oy, ly G i^{x), then y = x. This is because every prefix of y is a left 
special factor and hence also a prefix of the characteristic word x. Similarly if y 
is a left infinite word and yO, yl G i^{x), then y is equal to the reversal of x. If 
T is a morphism fixing some point y € fi{x), then t(z) G f2{x) for all z G f2{x). 

Suppose to the contrary that t ^ id is a morphism fixing a proper tail y of x. 
Then y is self-shuffling by Corollary [31 Put x ^ uy with u G {0, 1}+. Using the 
characterization of Sturmian morphisms (see Theorem 2.3.7 & Lemma 2.3.13 
in [5]) we deduce that t must be primitive. Thus we can assume that |r(a)| > 
1 for each a G {0, 1}. If r(0) and t(1) end in distinct letters, then as both 
Ot{x), 1t{x) G f^{x), it follows that t(x) = x. Since also T{y) = y and \t{u)\ > 

it follows that y is a proper tail of itself, a contradiction since x is aperiodic. 
Thus t(0) and t(1) must end in the same letter. Whence by Corollary[T]it follows 
that every backward extension of y is self-shuffiing, which is again a contradiction 
since Ox and la; are not self-shuffling. 

Next suppose r ^ id is a morphism fixing a point y = uabx G f2{x) where u G 
{0, 1}+ and {a,b} = {0, 1}. Again we can suppose t is primitive and |t(0)| > 1 
and |t(1)| > 1. If t(0) and r(l) begin in distinct letters, then T(i)0, T(i)l G f2{x) 
where x denotes the reverse of x. Thus t{x) = x. Thus for each prefix v of abx 
we have t{xv) = xt{v) whence t(v) is also a prefix of abx. Hence T(abx) = abx. 

® The choice of the letter s,m, and £ is intended to refer to small, medium, and large 
respectively. 



As before this implies that abx is a proper tail of itself which is a contradiction. 
Thus t(0) and r(l) begin in the same letter. Whence by Corollary [T] it follows 
that every tail of y is self-shuffling, which is again a contradiction since Ox and 
la; are not self-shuffling. 

Remark 3. In the case of the Fibonacci infinite word a;, each of {x, Ox, Ix, Olx, lOx} 
is pure morphic. For a general characteristic word x, since every point in the or- 
bit of X except for Ox and Ix is self-shuffling, it follows that if r is a morphism 
fixing X (respectively Olx or lOx), then t(0) and t(1) must end (respectively 
begin) in distinct letters. 
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6 Appendix 

6.1 Proof of Theorem E 

We state and prove the theorem in greater generahty for /c-self-shufhe. 

Definition 3. For x G A'^ and integer k > 2, define a directed graph Gj; 
(Vj:, E^) with the vertex set 

^|=i'f (pref.^.a^) = <f'(pref,^+...+,^x)}, 

and the edge set 

*i ^ * j for J = 1 , • • ■ , and i j + 1 = } . 



We say that connects to oo if there exists an infinite path v^v^v^ ■ ■ ■ in Gj 
such that = (0, • • • ,0) and w" — >■ oo as n — >■ oo for any j = 1, • • • , fc, where 



V 



Theorem 6. An infinite sequence x £ is k-self-shuffting if and only if the 
graph Gji connects to oc. 

Proof. If X = XqXi ■ ■ ■ with the Xi £ Ais fc-self-shuffiing, then there exist infinite 
subsets C N for j = 1, . . . , fc, such that x[N3] = x. Let = {N^ < < 
NI < ...} (j = l,...,fc). For any n € N, let t{j,n) be m G N such that 
^m-i < n < Nl^ for any j — 1, . . . , fc. Then, pref„x consists of prefj(j „)a;[A^-'] 
with J = 1, . . . , fc in Abehan sense. Since 

preft(j,„)a;[iV^] = preft(j „)X (j = 1, . . . , fc), 

we have 

fc 

^<f(pref,^.a;) = tZ/(pref„x) 
i=i 

with ij = t{j, n) and ii + . . . + ik — n. Therefore, (ii, . . . , ifc) £ . Let n ^ 
for some Z = 1, . . . , fc and Xn = b <E A, then we have 



:= n + 1) 

and 



t(j, n) + 1 for J = Z 
t{j, n) for j 7^ Z ' 



tf'(pref,,)x[iV^]) = 



r tf/(pref,^x[iVJ]) + 66 for j = / 
{^'(pref.^xiiVJ]) forj^/' 

where Cf, is a unit vector having 1 in a coordinate corresponding to the letter h. 
Moreover, since iZ'(pref„_|^;^a;) — !f'(pref„x) + Cb, we have 



^j=if (pref.-x) = If (pref„+ia:). 



Thus, {{ii, . . . , ik), {i'l, • ■ ■ , i'j,)) £ Ej^. Since this holds for any n G N and each 
TV' is an infinite set, Gj; connects to oo. 

Conversely, assume that connects to oo. Let v'^v^v'^ ... be an infinite 
path in such that d° = (0, . . . , 0) and i->- oo as n i->- oo for any j = 1, . . . ,k, 
where = (w", . . . , w^') e V^. Define 

= {n e N; > i;"} for j = 1, . . . , k. 

Then the sets give a partition of N. Let = {N^ < N( < < . . .} for 
j = 1, . . . , fc. For any I = 1, . . . , fc and m € N, let n = Nj;^, ij — t{j, n) and 
i'^ = t{j, n + 1) for j = 1, . . . , fc. Then, = (ii, . . . , i^) and = (i^, . . . , i',^). 
Moreover, ii — m and = rn + 1. Since S -E^, we have 

k 

!^(pref™+ix) - f (prefix) = ^(•Z'(pref,, 2;) - f (pref.^ a;)) 
= '^(pref„+ix) - If (pref„x). 

Hence, Xm = Xn — x[N'']m- Thus, the sets satisfy Definition 1, so we have a 
A;-self-shuffle. 

6.2 Proof of Theorem [2] 

Here we sketch the proof of Theorem O 

Proof. We will make use of the following four lemmas whose proofs are omitted: 

Lemma 5. Let u — U1U2 • • • it„ he a factor of x with each Ui minimal in x. Then 
u is minimal in x. 

A word u G A+ is called bordered, if there exists an integer fc, < fc < |m| 
such that pref j,M — sufFfcU. Otherwise u is called unbordered. 

Lemma 6. Let u be a minimal factor of x and let v be the longest unbordered 
prefix of u. Then v is a period of u, i.e., u is a prefix of w" for some positive 
integer n. 

Lemma 7. Let u and v be factors of x E with u minimal. Then either 
uv <icx V or else v is minimal. 

Let C{x) denote the set of all factors v of x with the property that no suffix of 
V (including v itself) is minimal in x. 

Lemma 8. Let u and v be factors of x with u minimal in x and v G C{x). Let 
s G ^{u, v). Then either s — vu or s <iex v. 



We first consider the case wiiere z — y, i.e, when w 6 y{y,y). Set 



w = n 

1=0 

where 

oo oo 
i=0 i=0 

with each Ui and Vi non-empty. Since y is aperiodic and Lyndon, it foUows 
from Lemma [S] that y contains arbitrarily long unbordered prefixes. Let v be an 
unbordered prefix of y with |?;| > \Uo\. Writing v = Uqv' , since v is unbordered, 
v' G C{x). Let s be such that Uqs is a prefix of w and \s\ = \v'\. Then by Lemma[5] 
we deduce that s <icx v' whence Uqs <icx v and hence w <icx J/- 
Next suppose z ^ y. Let 

oo 

where 

oo oo 

y^l[U, and z^Y[Vi 

with each Ui and Vi non-empty except for possibly Uq. Suppose first that Uq 
is non-empty, that is to say y dishes out the initial segment of w. Let r be 
the shortest non-minimal prefix of z. Then by Lemma [S]r e C{x). Let u be a 
prefix of y longer than \Uo\. Let t denote the prefix of w of length |r|; then by 
Lemma |8l t <icx r and hence w <icx z. Finally suppose that Uq is empty, so 
that z dishes out the initial segment of w. Let z' be a tail of z so that z — Vqz'. 
Writing w — Vqw' , it follows that w' is a shuffie of z' and y in which y dishes 
out the initial segment of z' . li z' ^ y then we are done by the preceding case, 
i.e, w' <iox z' whence w <icx z. If z' ~ y, then as we saw in the beginning of the 
proof, we again have w' <icx y — z' whence w <iox z. 

6.3 Proof of Theorem H 

We begin by showing that the conditions stated in Theorem |3] are in fact nec- 
essary for M G S^{S,L). To see this, suppose p{M) = p{S) and p[L) = (the 
other symmetric condition works analogously). This implies that L G {Ox, la;} 
where x is the characteristic Sturmian word of slope a. li L — Qx, then as Oa; 
is minimal in the Sturmian subshift of slope a, it follows that S — M — L. 
Whence by Corollary d M i y{M, M) = ,9'{S, L).U L = Ix, we consider the 
lexicographic order induced by > 1. Then L <iex M <icx S and moreover L 
is minimal. Since p{M) = p{S) we have that either case i) M — S or case ii) 
S = Ox and M = la; or case iii) there exists u G {0, 1}* such that S = uOlx 
and M = uWx where in each case x denotes the characteristic Sturmian word of 
slope a. In case i), using Theorem [5] we deduce that each element of y{S, L) is 



lexicographically smaller than S and hence since M — S we have M ^ ^{S, L). 
In case ii), if M g S^{S,L), then x 6 S'{x,Ox) which contradicts Theorem [31 
Finally, in case iii), suppose to the contrary that M € S^{S,L). Then since uO 
is not a prefix of M, it follows that there exists a non-empty prefix v of L and a 
prefix w of M such that \w\ = \uO\ + \v\ and M' e .y{L', Ix) where M' and L' 
are defined by M = wM' and L = t;L'. But this implies that M' = L', whence 
L' e ^{L' , Ix) which again contradicts Theorem [51 

We next show that the conditions stated in TheoremHlare sufficient. Without 
loss of generality we can assume that < a < 1/2. 

Our proof explicitly describes an algorithm for shuffling S and L so as to pro- 
duce M. It is formulated in terms of the circle rotation description of Sturmian 
words. Geometrically speaking, points p{S) and p{L) will take turns following 
the trajectory of p{M) so that the respective codings agree; as one follows the 
other waits its turn (remains neutral) . The algorithm specifies this following rule 
depending on the relative positions of the trajectories of all three points and is 
broken down into several cases. The proof can be summarized by the directed 
graph in Figure [H in which each state n corresponds to "case n" in the proof. 

The states : We denote by s, to, and £ the current tail of the words S, M, and 
L. They are initialized as 

s := S, i := L, and to := M. 

While to is always a tail of M, the letters s and £ may be tails of S' or L, depending 
on which is the current lexicographically largest. Each state, or case in the proof, 
is described by a figure depicting the relative positions of p{s),p{m) and p(£), 
which for the sake of simplicity, are actually labeled s,to and £ respectively. If 
X € {s, TO, £} is depicted inside the interval (0, 1 — a) (resp. (1 — a, 1)), then this 
implies that the first letter of the coding of a; is (respectively 1). Moreover the 
endpoints of the partition interval are regarded as both open and closed. For 
example, even if s and to are both depicted in the interval (0, 1 — a), it could be 
that p{s) — and p(to) — 1 — a. In the same way, even if s and to are depicted in 
distinct intervals of the circle partition, it could be that p{s) = p{m). In addition 
to their relative positions on the circle, each state lists a set of relations which 
the variables s, m and £ must satisfy. These conditions are written to the right 
of the circle picture and are described in terms of the following predicates: 

C{s,m,£) = [p{m) = p{s) and p{£) — 0] or [p{m) = p{£) and p{s) — 0]; 
Pi(s, TO, £) = p{s) = a and p{m) — and p{£) = 1 — a; 
P2{s,m,£) = [{p{£) — p(to)) mod I — a and p(s) = 1 — a] or p(m) = 0. 

All states, except those labeled 4 and 5, can be taken to be initial states. 

The edges : Each directed edge corresponds to a precise set of instructions 
which specify which of s or ^ is neutral, which of s or ^ follows m and for how 
long, and in the end a possible relabeling of the variables s and £. In each case 
the outcome leads to a new case in which there is a switch in the follower. In 



other words, if there is an edge from case i to case j in the graph, then either 
the instructions for case i and case j specify different fohowers (as is the case 
for cases 1.1 and 2.1) in which case the passage from i to j leaves the labeling 
of s and I unchanged, or the instructions for case i and case j specify the same 
follower (as is the case for cases 1.2 and 1.1) in which case the passage from i to 
j exchanges the labeling of s and t. 

The proof of Theorem 2] amounts to showing that for each state n in the graph, 
the specified instructions will take n to an adjacent state in the graph. 

What follows is a complete listing of all ten cases with their respective set of 
instructions. 

Case 1.1: 




s <iox rn <icx ^C{s, m, £) 



Instruction: s is neutral and £ follows m until they lie in different elements of 
the circle partition. No relabeling of s and i. 

Case 1.2: 




s <iex m — £; ->C{s, m, £) 



m, £ 



Instruction: s is neutral and £ follows m until < p{m) — p{£) < p{s). We note 
that this is always possible since p{s) ^ and the set {{p{m) + na) mod 1 : n G N} 
is dense in the unit circle. Exchange the labels s <r^ £. 

Case 2.1: 




s <iex rn; -^C{s,m,£) 



Instruction: £ is neutral and s follows m until they lie in different elements of 
the circle partition. No relabeling of s and £. Three cases are possible according 
to the relative position of m and £ in the partition (1 — a, 1). 



Case 2.2: 




s = m; -'C{s,m,£) 



Instruction: £ is neutral and s follows m until p{m) = p{s) > p{£). This is 
possible because p{£) ^ 0. Exchange the labels s ^ £. 

Case 3.1: 




m <iex £', ^C{s, m, £) 



Instruction: s is neutral and £ follows m until they lie in different elements of 

the circle partition. No relabeling of s and £. Three cases are possible according 
to the relative position of m and s in the partition (0, 1 — a). 

Case 3.2: 




m = £; -'C{s,m,£) 



Instruction: s is neutral and £ follows m until < p{m) = p{£) < p{s). This is 
possible because p{s) ^ 0. Exchange the labels s ^ £. 

Case 4: 




m >iex^; p{s) > a; -^Pi{s,m,£) 



Instruction: s is neutral and £ follows m for just one rotation by a. Exchange 
the labels s £. Because p{s) > a, we have either m <iex s or m = s. 



Case 5: 

V /-^ {p{i)- p{m)) modi < a; ^P2{s,m,i) 

• s 

m 

Instruction: £ is neutral and s follows m for just one rotatfon by a. Exchange 
the labels s ^ £. Because {p{£) ~ p{'m)) mod 1 < a, we have either m >\cx ^ or 
m = £. 

Case 6.1: 




s <iox m <iox £] ^C{s, m, £) 



Instruction: £ is neutral and s follows m until they lie in different elements of 
the circle partition. No relabeling of s and £. 

Case 6.2: 




s — m <iex ~'C(s, m, £) 



Instruction: £ is neutral and s follows m until: p{m) — p{s) > p{£). This is pos- 
sible because p{£) ^ 0. Exchange the labels s £. 

Here we verify four of the ten cases in the proof of Theorem |4l The verifica- 
tions of all cases are similar to one another. 

Verification of Case 1.1: To see that Case 1.1 leads to Case 2.1, let m' and £' 

denote the positions of m and £ respectively, the first time they lie in different 
elements of the circle partition. Then clearly < p{s) < p{m') < 1 — a < p{£') as 
required. It remains to show that after the rotation -^C{s,m' ,£') holds. Suppose 
to the contrary that C{s,m',£') holds. Because p{m') > p{s), we must have 
p(m') = p{£') and p{s) = 0. But this implies p{m) = p{£) and p{s) — 0, which is 
impossible since we had assumed -iC{s^m^£). 

Verification of Case 1.2: To see that Case 1.2 leads to Case 1.1, let m' and 

£' denote the positions of m and £ respectively, the first time < p(m') = 



p{(.') < p{s). Then clearly after exchanging the labels s <^ t the points s,m, 
and I are situated as specified in Case 1.1. It remains to show that -'C{£' , m', s). 
Suppose to the contrary that C{£',m',s) holds. Because m' = we must have 
p(m') = p{£') and p(s) = 0. But this implies p{m) = p{i) and p{s) = 0, which is 
impossible since we had assumed -'C{s,m,£). 

Verification of Case 2.1: Let s' and m' denote the positions of s and m 
respectively, the first time they lie in different elements of the circle partition. 
Note that since we have assumed a < 1/2, it follows that p{s') > a for otherwise 
m and s would have already differed earlicir. Three cases are possible: m' <iox £, 
m' = £ OT m' >iex £■ We show that this leads to cases 3.1, 3.2 and 4 respectively. 
Assume first that m' <iex £■ To show that this leads to Case 3.1 or Case 3.2, we 
must verify that -^C {s' , m' , £) . However we have a < p{s') < 1 — a, and hence 
p(s') ^ 0. If p{m') — p{s'), then p{m) = p{s), and hence p{£) ^ since we had 
assumed ^C{s,m,£). Next we suppose that m' > £. To show that this results in 
Case 4, we must show that 

-^Pi{s',m',£). 

Assume to the contrary that Pi{s' ,m' ,£), that is, that p(s') = a, p{m') = and 
p{£) = 1 — a. This implies m = Om' and s = 0s', and hence p{m) = 1 — a = p{£) 
and p{s) = 0, which is impossible since we had assumed ^C{s,m,£). 

Verification of Case 4: Let £' and m' denote the positions of £ and m after 
rotation by a. Because p{s) > a, we have either m' <\cx s ov m' = s. We 
will show that this leads to cases 1.1 and 1.2 respectively. In view of the label 
exchange s o ^, the relative positions of the three points is as required. It remains 
to check in both cases that -^C{£' ,m' ,s) holds. Since p{s) > a, it follows that 
p{s) ^ 0. If p(m') = p{s), then we actually have p{m') = p{s) = a. This implies 
p{m) = 0. Because we had assumed -'Pi{s,m,£), we obtain p{£) ^ 1 — a, and 
hence p{£') ^ as required. 



